Detecting geospatial location descriptions in natural language text
نویسندگان
چکیده
References to geographic locations are common in text data sources including social media and web pages. They take different forms from simple place names relative expressions that describe location through a spatial relationship reference object (e.g. the house beside Waikato River). Often complex, multi-word phrases employed road railway cross at right angles; line with canal) where relationships communicated various parts of speech prepositions, verbs, adverbs adjectives. We address problem automatically detecting geospatial descriptions, which we define as those include relation terms referencing objects, distinguishing them non-geographical descriptions book on table). experiment several methods for automated classification expressions, using features machine learning bag words detect distinctive words, word embeddings encode meanings manually identified language patterns characterise expressions. Using three sets created this study, find ensemble meta-classifier approaches, variously combine predictions other classifiers features, provide best F-measure 0.90
منابع مشابه
Mining Geospatial Path Data from Natural Language Descriptions
In this paper, we describe the TEGUS system for mining geospatial path data from natural language descriptions. TEGUS uses natural language processing, GIS entity databases, and graph-based path finding to predict lat/lon paths based only on natural language text input. We also report on preliminary results from experiments on a corpus of path descriptions.
متن کاملRevealing Geospatial Semantics in Text Descriptions
This paper seeks to reveal the semantics of geospatial concepts as it can be elicited from text descriptions. So far, such an approach has dealt with the analysis of existent repositories of geographic information. Taking up with the study of the semantics of geospatial concepts, we turn to the analysis of the linguistic expressions of space and consequently of the “things” it includes. More sp...
متن کاملGenerating Natural-Language Video Descriptions Using Text-Mined Knowledge
We present a holistic data-driven technique that generates natural-language descriptions for videos. We combine the output of state-of-the-art object and activity detectors with “real-world” knowledge to select the most probable subject-verb-object triplet for describing a video. We show that this knowledge, automatically mined from web-scale text corpora, enhances the triplet selection algorit...
متن کاملDetecting conditional healthiness of food items from natural language text
In this article, we explore the feasibility of extracting suitable and unsuitable food items for particular health conditions from natural language text. We refer to this task as conditional healthiness classification. For that purpose, we annotate a corpus extracted from forum entries of a food-related website. We identify different relation types that hold between food items and health condit...
متن کاملConstraint Detection in Natural Language Problem Descriptions
Modeling in constraint programming is a hard task that requires considerable expertise. Automated model reformulation aims at assisting a naive user in modeling constraint problems. In this context, formal specification languages have been devised to express constraint problems in a manner similar to natural yet rigorous specifications that use a mixture of natural language and discrete mathema...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: International Journal of Geographical Information Science
سال: 2021
ISSN: ['1365-8824', '1365-8816']
DOI: https://doi.org/10.1080/13658816.2021.1987441